Confidence Interval for the Difference in Classification Error

نویسندگان

William Elazmeh

Nathalie Japkowicz

Stan Matwin

چکیده

Evaluating classifiers with increased confidence can significantly impact the success of many machine learning applications. However, traditional machine learning evaluation measures fail to provide any levels of confidence in their results. In this paper, we motivate the need for confidence in classifier evaluation at a level suitable for medical studies. We draw a parallel between case-control medical studies and classification in machine learning. We propose the use of Tango’s biostatistical test to compute consistent confidence intervals on the difference in classification errors on both classes. Our experiments compare Tango’s confidence intervals to accuracy, recall, precision, and the F measure. Our results show that Tango’s test provides a statistically sound notion of confidence and is more consistent and reliable than the above measures.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

برآورد فاصله اطمینان برای نسبت‌های نزدیک به صفر و یک: یک مطالعه ثانویه مدل سازی

Background and Objectives: When computing a confidence interval for a binomial proportion p, one must choose an exact interval that has a coverage probability of at least 1-α for all values of p. In this study, we compared the confidence intervals of Clopper-Pearson, Wald, Wilson, and double ArcSin transformation in terms of maintaining a constant nominal type I error. Methods: Simulations w...

متن کامل

A Procedure for Building Confidence Interval on the Mean of Simulation Output Data

One of the existing methods to build a confidence interval (c.i.) for the mean response in a single steady state simulation system is the batch means method. This method, compared to the other existing methods (autoregressive representation, regenerative cycles, spectrum analysis, standardized time series), is quite easy to understand and to implement and performs relatively well. However, the ...

متن کامل

Constructing a Confidence Interval for Quantiles of Normal Distribution‎, ‎one and Two Population

‎In this paper‎, ‎in order to establish a confidence interval (general and shortest) for quantiles of normal distribution in the case of one population‎, ‎we present a pivotal quantity that has non-central t distribution‎. ‎In the case of two independent normal populations‎, ‎we construct a confidence interval for the difference quantiles based on the generalized pivotal quantity and introduce ...

متن کامل

A survey on clinical effectiveness of orlistat compared to sibutramine, lorcaserin, metformin and placebo on weight loss in obese people: a network meta-analysis

Background: Trying to find a drug with more clinical efficacy in treating obesity is one of the priorities. The aim of this study was to evaluate the efficacy of orlistat, sibutramine, lorcaserin and metformin on weight loss in obese people. Methods: The databases of PubMed, Scopus, Google Scholar and Cochran Library were searched up to November 2016. In present study search strategy was perfo...

متن کامل

Invariant Empirical Bayes Confidence Interval for Mean Vector of Normal Distribution and its Generalization for Exponential Family

Based on a given Bayesian model of multivariate normal with known variance matrix we will find an empirical Bayes confidence interval for the mean vector components which have normal distribution. We will find this empirical Bayes confidence interval as a conditional form on ancillary statistic. In both cases (i.e. conditional and unconditional empirical Bayes confidence interval), the empiri...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

Confidence Interval for the Difference in Classification Error

نویسندگان

چکیده

منابع مشابه

برآورد فاصله اطمینان برای نسبت‌های نزدیک به صفر و یک: یک مطالعه ثانویه مدل سازی

A Procedure for Building Confidence Interval on the Mean of Simulation Output Data

Constructing a Confidence Interval for Quantiles of Normal Distribution‎, ‎one and Two Population

A survey on clinical effectiveness of orlistat compared to sibutramine, lorcaserin, metformin and placebo on weight loss in obese people: a network meta-analysis

Invariant Empirical Bayes Confidence Interval for Mean Vector of Normal Distribution and its Generalization for Exponential Family

عنوان ژورنال:

اشتراک گذاری